BabyTalk: Understanding and Generating Simple Image Descriptions
نویسندگان
چکیده
منابع مشابه
Baby Talk: Understanding and Generating Image Descriptions
We posit that visually descriptive language offers computer vision researchers both information about the world, and information about how people describe the world. The potential benefit from this source is made more significant due to the enormous amount of language data easily available today. We present a system to automatically generate natural language descriptions from images that exploi...
متن کاملSentiCap: Generating Image Descriptions with Sentiments
The recent progress on image recognition and language modeling is making automatic description of image content a reality. However, stylized, non-factual aspects of the written description are missing from the current systems. One such style is descriptions with emotions, which is commonplace in everyday communication, and influences decision-making and interpersonal relationships. We design a ...
متن کاملGenerating Image Descriptions using Multilingual Data
In this paper we explore several neural network architectures for the WMT 2017 multimodal translation sub-task on multilingual image caption generation. The goal of the task is to generate image captions in German, using a training corpus of images with captions in both English and German. We explore several models which attempt to generate captions for both languages, ignoring the English outp...
متن کاملGenerating Image Descriptions Using Dependency Relational Patterns
This paper presents a novel approach to automatic captioning of geo-tagged images by summarizing multiple webdocuments that contain information related to an image’s location. The summarizer is biased by dependency pattern models towards sentences which contain features typically provided for different scene types such as those of churches, bridges, etc. Our results show that summaries biased b...
متن کاملConstructing Simple Stable Descriptions for Image
(1) In eeect, Binford 2] calls stability with respect to change in viewpoint the \assump-tion of general position." In this sense, general position is a special case of our notion of stability. (2) An optimal descriptive language is one that minimizes the average number of bits of description per bit of input. This will be discussed in detail shortly. (3) The inequality occurs only because of b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence
سال: 2013
ISSN: 0162-8828,2160-9292
DOI: 10.1109/tpami.2012.162